Minimizing Slowdown in Heterogeneous Size-Aware Dispatching Systems (full version)
نویسندگان
چکیده
We consider a system of parallel queues where tasks are assigned (dispatched) to one of the available servers upon arrival. The dispatching decision is based on the full state information, i.e., on the sizes of the new and existing jobs. We are interested in minimizing the so-called mean slowdown criterion corresponding to the mean of the sojourn time divided by the processing time. Assuming no new jobs arrive, the shortestprocessing-time-product (SPTP) schedule is known to minimize the slowdown of the existing jobs. The main contribution of this paper is three-fold: 1) To show the optimality of SPTP with respect to slowdown in a single server queue under Poisson arrivals; 2) to derive the so-called size-aware value functions for M/G/1-FIFO/LIFO/SPTP/SPT/SRPT with general holding costs of which the slowdown criterion is a special case; and 3) to utilize the value functions to derive efficient dispatching policies so as to minimize the mean slowdown in a heterogeneous server system. The derived policies offer a significantly better performance than e.g., the size-aware-task-assignment with equal load (SITA-E) and least-work-left (LWL) policies.
منابع مشابه
Size- and state-aware dispatching problem with queue-specific job sizes
We consider the dispatching problem in a sizeand state-aware multi-queue system with Poisson arrivals and queue-specific job sizes. By sizeand state-awareness, we mean that the dispatcher knows the size of an arriving job and the remaining service times of the jobs in each queue. By queue-specific job sizes, we mean that the time to process a job may depend on the chosen server. We focus on min...
متن کاملAn Energy-Aware Workload Dispatching Simulator for Heterogeneous Clusters
This paper presents an energy-aware workload dispatching simulator for heterogeneous clusters. Most clusters in a data center are composed of different kind of machines. Among these machines, the front-end nodes distribute incoming requests to the back-end workers. The main concern in such system traditionally focuses on computation performance, but energy consumption has emerged as an equally ...
متن کاملMinimizing the worst slowdown: off-line and on-line
Minimizing the slowdown (expected sojourn time divided by job size) is a key concern of fairness in scheduling and queuing problems where job sizes are very heterogeneous. We look for protocols (service disciplines) capping the worst slowdown a job may face no matter how large (or small) the other jobs are. We call this worst slowdown the liability of the job in question. In the scheduling prob...
متن کاملEstimating Effective Slowdown of Tasks in Energy-Efficient Cloud Systems
Consolidation consists in scheduling multiple virtual machines onto fewer servers in order to improve resource utilization and to reduce operational costs due to power consumption. However, virtualization technologies do not offer performance isolation, causing applications’ slowdown. In this work, we propose a performance enforcing mechanism, composed of a slowdown estimator, and a interferenc...
متن کاملDecentralized Probabilistic Job Dispatching in Heterogeneous Multiple Computer Systems
In this paper, we study decentralized probabilistic job dispatching and load balancing strategies which optimize the performance of heterogeneous multiple computer systems. We present a model to study a heterogeneous multiple computer system with a decentralized stochastic job dispatching mechanism, where nodes are treated as M/G/1 servers. We discuss a way to implement a virtual centralized jo...
متن کامل